AITopics | Kyzyl

Collaborating Authors

Kyzyl

Augmenting Human Cognition through Everyday AR

arXiv.org Artificial IntelligenceMay-7-2025

As spatial computing and multimodal LLMs mature, AR is tending to become an intuitive "thinking tool," embedding semantic and context-aware intelligence directly into everyday environments. This paper explores how always-on AR can seamlessly bridge digital cognition and physical affordances, enabling proactive, context-sensitive interactions that enhance human task performance and understanding.

artificial intelligence, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2505.03492

Country:

North America > United States > New York > New York County > New York City (0.07)
Asia > Russia > Siberian Federal District > Republic of Tuva > Kyzyl (0.05)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.36)

Add feedback

Peek into the `White-Box': A Field Study on Bystander Engagement with Urban Robot Uncertainty

Yu, Xinyan, Hoggenmueller, Marius, Tran, Tram Thi Minh, Wang, Yiyuan, Zhang, Qiuming, Tomitsch, Martin

arXiv.org Artificial IntelligenceFeb-28-2025

Uncertainty inherently exists in the autonomous decision-making process of robots. Involving humans in resolving this uncertainty not only helps robots mitigate it but is also crucial for improving human-robot interactions. However, in public urban spaces filled with unpredictability, robots often face heightened uncertainty without direct human collaborators. This study investigates how robots can engage bystanders for assistance in public spaces when encountering uncertainty and examines how these interactions impact bystanders' perceptions and attitudes towards robots. We designed and tested a speculative `peephole' concept that engages bystanders in resolving urban robot uncertainty. Our design is guided by considerations of non-intrusiveness and eliciting initiative in an implicit manner, considering bystanders' unique role as non-obligated participants in relation to urban robots. Drawing from field study findings, we highlight the potential of involving bystanders to mitigate urban robots' technological imperfections to both address operational challenges and foster public acceptance of urban robots. Furthermore, we offer design implications to encourage bystanders' involvement in mitigating the imperfections.

bystander, computing machinery, robot, (11 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3706598.3713790

2503.00337

Country:

Europe > Austria > Vienna (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
(32 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation (0.68)
Health & Medicine (0.47)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.67)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.47)

Add feedback

Co-Designing Augmented Reality Tools for High-Stakes Clinical Teamwork

Taylor, Angelique, Tanjim, Tauhid, Cao, Huajie, Nicoly, Jalynn Blu, Segal, Jonathan I., George, Jonathan St., Kim, Soyon, Ching, Kevin, Ortega, Francisco R., Lee, Hee Rin

arXiv.org Artificial IntelligenceFeb-24-2025

How might healthcare workers (HCWs) leverage augmented reality head-mounted displays (AR-HMDs) to enhance teamwork? Although AR-HMDs have shown immense promise in supporting teamwork in healthcare settings, design for Emergency Department (ER) teams has received little attention. The ER presents unique challenges, including procedural recall, medical errors, and communication gaps. To address this gap, we engaged in a participatory design study with healthcare workers to gain a deep understanding of the potential for AR-HMDs to facilitate teamwork during ER procedures. Our results reveal that AR-HMDs can be used as an information-sharing and information-retrieval system to bridge knowledge gaps, and concerns about integrating AR-HMDs in ER workflows. We contribute design recommendations for seven role-based AR-HMD application scenarios involving HCWs with various expertise, working across multiple medical tasks. We hope our research inspires designers to embark on the development of new AR-HMD applications for high-stakes, team environments.

ar-hmd, information, participant, (14 more...)

arXiv.org Artificial Intelligence

2502.17295

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Michigan > Ingham County > Lansing (0.04)
(13 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology > Telehealth (1.00)
Health & Medicine > Health Care Providers & Services (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols

Chung, John Joon Young, Roemmele, Melissa, Kreminski, Max

arXiv.org Artificial IntelligenceJan-22-2025

We introduce Toyteller, an AI-powered storytelling system where users generate a mix of story text and visuals by directly manipulating character symbols like they are toy-playing. Anthropomorphized symbol motions can convey rich and nuanced social interactions; Toyteller leverages these motions (1) to let users steer story text generation and (2) as a visual output format that accompanies story text. We enabled motion-steered text generation and text-steered motion generation by mapping motions and text onto a shared semantic space so that large language models and motion generation models can use it as a translational layer. Technical evaluations showed that Toyteller outperforms a competitive baseline, GPT-4o. Our user study identified that toy-playing helps express intentions difficult to verbalize. However, only motions could not express all user intentions, suggesting combining it with other modalities like language. We discuss the design space of toy-playing interactions and implications for technical HCI research on human-AI interaction.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3706598.3713435

2501.13284

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.05)
(22 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > Experimental Study (0.92)

Industry: Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Augmented Conversation with Embedded Speech-Driven On-the-Fly Referencing in AR

Jadon, Shivesh, Faridan, Mehrad, Mah, Edward, Vaish, Rajan, Willett, Wesley, Suzuki, Ryo

arXiv.org Artificial IntelligenceMay-28-2024

This paper introduces the concept of augmented conversation, which aims to support co-located in-person conversations via embedded speech-driven on-the-fly referencing in augmented reality (AR). Today computing technologies like smartphones allow quick access to a variety of references during the conversation. However, these tools often create distractions, reducing eye contact and forcing users to focus their attention on phone screens and manually enter keywords to access relevant information. In contrast, AR-based on-the-fly referencing provides relevant visual references in real-time, based on keywords extracted automatically from the spoken conversation. By embedding these visual references in AR around the conversation partner, augmented conversation reduces distraction and friction, allowing users to maintain eye contact and supporting more natural social interactions. To demonstrate this concept, we developed \system, a Hololens-based interface that leverages real-time speech recognition, natural language processing and gaze-based interactions for on-the-fly embedded visual referencing. In this paper, we explore the design space of visual referencing for conversations, and describe our our implementation -- building on seven design guidelines identified through a user-centered design process. An initial user study confirms that our system decreases distraction and friction in conversations compared to smartphone searches, while providing highly useful and relevant information.

information, interaction, participant, (12 more...)

arXiv.org Artificial Intelligence

2405.18537

Country:

North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.29)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > France (0.04)
(5 more...)

Genre: Questionnaire & Opinion Survey (1.00)

Industry:

Information Technology (1.00)
Health & Medicine (0.95)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

RealitySummary: On-Demand Mixed Reality Document Enhancement using Large Language Models

Gunturu, Aditya, Jadon, Shivesh, Zhang, Nandi, Thundathil, Jarin, Willett, Wesley, Suzuki, Ryo

arXiv.org Artificial IntelligenceMay-28-2024

We introduce RealitySummary, a mixed reality reading assistant that can enhance any printed or digital document using on-demand text extraction, summarization, and augmentation. While augmented reading tools promise to enhance physical reading experiences with overlaid digital content, prior systems have typically required pre-processed documents, which limits their generalizability and real-world use cases. In this paper, we explore on-demand document augmentation by leveraging large language models. To understand generalizable techniques for diverse documents, we first conducted an exploratory design study which identified five categories of document enhancements (summarization, augmentation, navigation, comparison, and extraction). Based on this, we developed a proof-of-concept system that can automatically extract and summarize text using Google Cloud OCR and GPT-4, then embed information around documents using a Microsoft Hololens 2 and Apple Vision Pro. We demonstrate real-time examples of six specific document augmentations: 1) summaries, 2) comparison tables, 3) timelines, 4) keyword lists, 5) summary highlighting, and 6) information cards. Results from a usability study (N=12) and in-the-wild study (N=11) highlight the potential benefits of on-demand MR document enhancement and opportunities for future research.

enhancement, participant, realitysummary, (14 more...)

arXiv.org Artificial Intelligence

2405.1862

Country:

North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.29)
Europe > Ukraine (0.14)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report (0.82)
Overview (0.67)
Instructional Material > Course Syllabus & Notes (0.48)

Industry:

Education (1.00)
Information Technology > Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Augmented Physics: A Machine Learning-Powered Tool for Creating Interactive Physics Simulations from Static Diagrams

Gunturu, Aditya, Wen, Yi, Thundathil, Jarin, Zhang, Nandi, Kazi, Rubaiat Habib, Suzuki, Ryo

arXiv.org Artificial IntelligenceMay-28-2024

We introduce Augmented Physics, a machine learning-powered tool designed for creating interactive physics simulations from static textbook diagrams. Leveraging computer vision techniques, such as Segment Anything and OpenCV, our web-based system enables users to semi-automatically extract diagrams from physics textbooks and then generate interactive simulations based on the extracted content. These interactive diagrams are seamlessly integrated into scanned textbook pages, facilitating interactive and personalized learning experiences across various physics concepts, including gravity, optics, circuits, and kinematics. Drawing on an elicitation study with seven physics instructors, we explore four key augmentation techniques: 1) augmented experiments, 2) animated diagrams, 3) bi-directional manipulatives, and 4) parameter visualization. We evaluate our system through technical evaluation, a usability study (N=12), and expert interviews (N=12). The study findings suggest that our system can facilitate more engaging and personalized learning experiences in physics education.

diagram, proceedings, simulation, (13 more...)

arXiv.org Artificial Intelligence

2405.18614

Country:

North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.15)
North America > United States > New York > New York County > New York City (0.04)
Asia > China > Hong Kong (0.04)
(6 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Instructional Material (1.00)
Personal > Interview (0.48)
Research Report > New Finding (0.34)

Industry:

Education > Educational Setting (1.00)
Education > Curriculum > Subject-Specific Education (0.90)
Education > Educational Technology > Educational Software > Computer Based Training (0.86)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Characterizing and modeling harms from interactions with design patterns in AI interfaces

Ibrahim, Lujain, Rocher, Luc, Valdivia, Ana

arXiv.org Artificial IntelligenceMay-20-2024

The proliferation of applications using artificial intelligence (AI) systems has led to a growing number of users interacting with these systems through sophisticated interfaces. Human-computer interaction research has long shown that interfaces shape both user behavior and user perception of technical capabilities and risks. Yet, practitioners and researchers evaluating the social and ethical risks of AI systems tend to overlook the impact of anthropomorphic, deceptive, and immersive interfaces on human-AI interactions. Here, we argue that design features of interfaces with adaptive AI systems can have cascading impacts, driven by feedback loops, which extend beyond those previously considered. We first conduct a scoping review of AI interface designs and their negative impact to extract salient themes of potentially harmful design patterns in AI interfaces. Then, we propose Design-Enhanced Control of AI systems (DECAI), a conceptual model to structure and facilitate impact assessments of AI interface designs. DECAI draws on principles from control systems theory -- a theory for the analysis and design of dynamic physical systems -- to dissect the role of the interface in human-AI systems. Through two case studies on recommendation systems and conversational language model systems, we show how DECAI can be used to evaluate AI interface designs.

affordance, interface, proceedings, (17 more...)

arXiv.org Artificial Intelligence

2404.1137

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Virginia (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology > Services (1.00)
Government (1.00)
Law (0.93)
(2 more...)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(2 more...)

Add feedback

Augmented Object Intelligence: Making the Analog World Interactable with XR-Objects

Dogan, Mustafa Doga, Gonzalez, Eric J., Colaco, Andrea, Ahuja, Karan, Du, Ruofei, Lee, Johnny, Gonzalez-Franco, Mar, Kim, David

arXiv.org Artificial IntelligenceApr-22-2024

Seamless integration of physical objects as interactive digital entities remains a challenge for spatial computing. This paper introduces Augmented Object Intelligence (AOI), a novel XR interaction paradigm designed to blur the lines between digital and physical by equipping real-world objects with the ability to interact as if they were digital, where every object has the potential to serve as a portal to vast digital functionalities. Our approach utilizes object segmentation and classification, combined with the power of Multimodal Large Language Models (MLLMs), to facilitate these interactions. We implement the AOI concept in the form of XR-Objects, an open-source prototype system that provides a platform for users to engage with their physical environment in rich and contextually relevant ways. This system enables analog objects to not only convey information but also to initiate digital actions, such as querying for details or executing tasks. Our contributions are threefold: (1) we define the AOI concept and detail its advantages over traditional AI assistants, (2) detail the XR-Objects system's open-source design and implementation, and (3) show its versatility through a variety of use cases and a user study.

computing machinery, interaction, xr-object, (9 more...)

arXiv.org Artificial Intelligence

2404.13274

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.07)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(12 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry:

Health & Medicine > Consumer Health (0.69)
Education (0.68)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Swarm Body: Embodied Swarm Robots

Ichihashi, Sosuke, Kuroki, So, Nishimura, Mai, Kasaura, Kazumi, Hiraki, Takefumi, Tanaka, Kazutoshi, Yoshida, Shigeo

arXiv.org Artificial IntelligenceFeb-29-2024

The human brain's plasticity allows for the integration of artificial body parts into the human body. Leveraging this, embodied systems realize intuitive interactions with the environment. We introduce a novel concept: embodied swarm robots. Swarm robots constitute a collective of robots working in harmony to achieve a common objective, in our case, serving as functional body parts. Embodied swarm robots can dynamically alter their shape, density, and the correspondences between body parts and individual robots. We contribute an investigation of the influence on embodiment of swarm robot-specific factors derived from these characteristics, focusing on a hand. Our paper is the first to examine these factors through virtual reality (VR) and real-world robot studies to provide essential design considerations and applications of embodied swarm robots. Through quantitative and qualitative analysis, we identified a system configuration to achieve the embodiment of swarm robots.

embodiment, robot, swarm robot, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3613904.3642870

2402.1583

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.14)
North America > United States > New York > New York County > New York City (0.06)
(15 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback